智能论文笔记

Will there be a construction? Predicting road constructions based on heterogeneous spatiotemporal data

Amin Karimi Monsefi , Sobhan Moosavi , Rajiv Ramnath

分类：机器学习

2022-09-14

道路建设项目维护运输基础设施。这些项目的范围从短期（例如，重新铺面或固定坑洼）到长期（例如，添加肩膀或建造桥梁）。传统上，确定下一个建设项目是什么以及安排什么何时进行安排，这是通过人类使用特殊设备的检查来完成的。这种方法是昂贵且难以扩展的。另一种选择是使用计算方法来整合和分析多种过去和现在的时空数据以预测未来道路构建的位置和时间。本文报告了这种方法，该方法使用基于深神经网络的模型来预测未来的结构。我们的模型在由构造，天气，地图和道路网络数据组成的异质数据集上应用卷积和经常性组件。我们还报告了如何通过构建一个名为“美国建设”的大型数据集来解决我们如何解决足够的公开数据，其中包括620万个道路构造案例，并通过各种时空属性和路线网络功能增强，收集了。在2016年至2021年之间的连续美国（美国）中。使用对美国几个主要城市进行广泛的实验，我们显示了工作在准确预测未来建筑时的适用性 - 平均F1得分为0.85，准确性为82.2％ - 这是52.2％ - 胜过基线。此外，我们展示了我们的培训管道如何解决数据的空间稀疏性。

translated by 谷歌翻译

Scalable Deep Graph Clustering with Random-walk based Self-supervised Learning

Xiang Li , Dong Li , Ruoming Jin , Gagan Agrawal , Rajiv Ramnath

分类：机器学习

2021-12-31

基于Web的交互可以经常由归因图表示，并且在这些图中的节点聚类最近受到了很多关注。多次努力已成功应用图形卷积网络（GCN），但由于GCNS已被显示出遭受过平滑问题的GCNS的精度一些限制。虽然其他方法（特别是基于拉普拉斯平滑的方法）已经报告了更好的准确性，但所有工作的基本限制都是缺乏可扩展性。本文通过将LAPLACIAN平滑与广义的PageRank相同，并将随机步行基于算法应用为可伸缩图滤波器来解决这一打开问题。这构成了我们可扩展的深度聚类算法RWSL的基础，其中通过自我监督的迷你批量培训机制，我们同时优化了一个深度神经网络，用于采样集群分配分配和AutoEncoder，用于群集导向的嵌入。使用6个现实世界数据集和6个聚类指标，我们表明RWSL实现了几个最近基线的结果。最值得注意的是，我们显示与所有其他深度聚类框架不同的RWSL可以继续以超过一百万个节点的图形扩展，即句柄。我们还演示了RWSL如何在仅使用单个GPU的18亿边缘的图表上执行节点聚类。

translated by 谷歌翻译

Multiscale Graph Neural Networks for Protein Residue Contact Map Prediction

Kuang Liu , Rajiv K. Kalia , Xinlian Liu , Aiichiro Nakano , Ken-ichi Nomura , Priya Vashishta , Rafael Zamora-Resendizc

分类：人工智能 | 机器学习

2022-12-02

Machine learning (ML) is revolutionizing protein structural analysis, including an important subproblem of predicting protein residue contact maps, i.e., which amino-acid residues are in close spatial proximity given the amino-acid sequence of a protein. Despite recent progresses in ML-based protein contact prediction, predicting contacts with a wide range of distances (commonly classified into short-, medium- and long-range contacts) remains a challenge. Here, we propose a multiscale graph neural network (GNN) based approach taking a cue from multiscale physics simulations, in which a standard pipeline involving a recurrent neural network (RNN) is augmented with three GNNs to refine predictive capability for short-, medium- and long-range residue contacts, respectively. Test results on the ProteinNet dataset show improved accuracy for contacts of all ranges using the proposed multiscale RNN+GNN approach over the conventional approach, including the most challenging case of long-range contact prediction.

translated by 谷歌翻译

Scalable Pathogen Detection from Next Generation DNA Sequencing with Deep Learning

Sai Narayanan , Sathyanarayanan N. Aakur , Priyadharsini Ramamurthy , Arunkumar Bagavathi , Vishalini Ramnath , Akhilesh Ramachandran

分类：机器学习

2022-11-30

Next-generation sequencing technologies have enhanced the scope of Internet-of-Things (IoT) to include genomics for personalized medicine through the increased availability of an abundance of genome data collected from heterogeneous sources at a reduced cost. Given the sheer magnitude of the collected data and the significant challenges offered by the presence of highly similar genomic structure across species, there is a need for robust, scalable analysis platforms to extract actionable knowledge such as the presence of potentially zoonotic pathogens. The emergence of zoonotic diseases from novel pathogens, such as the influenza virus in 1918 and SARS-CoV-2 in 2019 that can jump species barriers and lead to pandemic underscores the need for scalable metagenome analysis. In this work, we propose MG2Vec, a deep learning-based solution that uses the transformer network as its backbone, to learn robust features from raw metagenome sequences for downstream biomedical tasks such as targeted and generalized pathogen detection. Extensive experiments on four increasingly challenging, yet realistic diagnostic settings, show that the proposed approach can help detect pathogens from uncurated, real-world clinical samples with minimal human supervision in the form of labels. Further, we demonstrate that the learned representations can generalize to completely unrelated pathogens across diseases and species for large-scale metagenome analysis. We provide a comprehensive evaluation of a novel representation learning framework for metagenome-based disease diagnostics with deep learning and provide a way forward for extracting and using robust vector representations from low-cost next generation sequencing to develop generalizable diagnostic tools.

translated by 谷歌翻译

User-Entity Differential Privacy in Learning Natural Language Models

Phung Lai , NhatHai Phan , Tong Sun , Rajiv Jain , Franck Dernoncourt , Jiuxiang Gu , Nikolaos Barmpalios

分类：自然语言处理 | 机器学习

2022-11-01

In this paper, we introduce a novel concept of user-entity differential privacy (UeDP) to provide formal privacy protection simultaneously to both sensitive entities in textual data and data owners in learning natural language models (NLMs). To preserve UeDP, we developed a novel algorithm, called UeDP-Alg, optimizing the trade-off between privacy loss and model utility with a tight sensitivity bound derived from seamlessly combining user and sensitive entity sampling processes. An extensive theoretical analysis and evaluation show that our UeDP-Alg outperforms baseline approaches in model utility under the same privacy budget consumption on several NLM tasks, using benchmark datasets.

translated by 谷歌翻译

Collisionless Pattern Discovery in Robot Swarms Using Deep Reinforcement Learning

Nelson Sharma , Aswini Ghosh , Rajiv Misra , Supratik Mukhopadhyay , Gokarna Sharma

分类：机器人

2022-09-20

我们提出了一个基于强化的学习框架，用于自动发现在脂肪机器人群的任何初始配置中可用的模式。特别是，我们对脂肪机器人群中无碰撞收集和相互可见性的问题进行了建模，并发现使用我们的框架来解决它们的模式。我们表明，通过根据某些约束（例如相互可见性和安全接口）来塑造奖励信号，机器人可以发现无碰撞的轨迹，导致形成良好的聚集和可见性模式。

translated by 谷歌翻译

Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks

Rajiv Movva , Jinhao Lei , Shayne Longpre , Ajay Gupta , Chris DuBois

分类：自然语言处理

2022-08-20

量化，知识蒸馏和修剪是NLP中神经网络压缩的最流行方法之一。独立地，这些方法降低了模型的大小并可以加速推断，但是尚未严格研究它们的相对益处和组合相互作用。对于这些技术的八个可能子集中的每一个，我们比较了六个BERT体系结构和八个胶水任务的准确性与模型大小的权衡。我们发现量化和蒸馏始终比修剪更大的好处。出乎意料的是，除了将多种方法一起使用多种修剪和量化之外，很少会产生回报的减少。取而代之的是，我们观察到互补和超级义务减少了模型大小。我们的工作定量表明，结合压缩方法可以协同降低模型大小，并且从业者应优先考虑（1）量化，（2）知识蒸馏，（3）修剪以最大程度地提高准确性与模型大小的权衡。

translated by 谷歌翻译

Persuasion Strategies in Advertisements: Dataset, Modeling, and Baselines

Yaman Kumar Singla , Rajat Jha , Arunim Gupta , Milan Aggarwal , Aditya Garg , Ayush Bhardwaj , Tushar , Balaji Krishnamurthy , Rajiv Ratn Shah , Changyou Chen

分类：自然语言处理 | 计算机视觉

2022-08-20

建模是什么使广告有说服力的原因，即引起消费者的所需响应，对于宣传，社会心理学和营销的研究至关重要。尽管其重要性，但计算机视觉中说服力的计算建模仍处于起步阶段，这主要是由于缺乏可以提供与ADS相关的说服力标签的基准数据集。由社会心理学和市场营销中的说服文学的激励，我们引入了广泛的说服策略词汇，并建立了用说服策略注释的第一个AD图像语料库。然后，我们通过多模式学习制定说服策略预测的任务，在该任务中，我们设计了一个多任务注意融合模型，该模型可以利用其他广告理解的任务来预测说服策略。此外，我们对30家财富500家公司的1600个广告活动进行了真实的案例研究，我们使用模型的预测来分析哪些策略与不同的人口统计学（年龄和性别）一起使用。该数据集还提供图像分割掩码，该蒙版在测试拆分上标记了相应的AD图像中的说服力策略。我们公开发布代码和数据集https://midas-research.github.io/persuasion-avertisements/。

translated by 谷歌翻译

Temporal View Synthesis of Dynamic Scenes through 3D Object Motion Estimation with Multi-Plane Images

Nagabhushan Somraj , Pranali Sancheti , Rajiv Soundararajan

分类：计算机视觉

2022-08-19

可以通过定期预测未来的框架以增强虚拟现实应用程序中的用户体验，从而解决了低计算设备上图形渲染高帧速率视频的挑战。这是通过时间视图合成（TVS）的问题来研究的，该问题的目标是预测给定上一个帧的视频的下一个帧以及上一个和下一个帧的头部姿势。在这项工作中，我们考虑了用户和对象正在移动的动态场景的电视。我们设计了一个将运动解散到用户和对象运动中的框架，以在预测下一帧的同时有效地使用可用的用户运动。我们通过隔离和估计过去框架的3D对象运动，然后推断它来预测对象的运动。我们使用多平面图像（MPI）作为场景的3D表示，并将对象运动作为MPI表示中相应点之间的3D位移建模。为了在估计运动时处理MPI中的稀疏性，我们将部分卷积和掩盖的相关层纳入了相应的点。然后将预测的对象运动与给定的用户或相机运动集成在一起，以生成下一帧。使用不合格的填充模块，我们合成由于相机和对象运动而发现的区域。我们为动态场景的电视开发了一个新的合成数据集，该数据集由800个以全高清分辨率组成的视频组成。我们通过数据集和MPI Sintel数据集上的实验表明我们的模型优于文献中的所有竞争方法。

translated by 谷歌翻译

Large vocabulary speech recognition for languages of Africa: multilingual modeling and self-supervised learning

Sandy Ritchie , You-Chi Cheng , Mingqing Chen , Rajiv Mathews , Daan van Esch , Bo Li , Khe Chai Sim

分类：自然语言处理

2022-08-05

在非洲使用的2,000多种语言几乎都没有广泛可用的自动语音识别系统，并且所需的数据也仅适用于几种语言。我们已经尝试了两种技术，这些技术可能为非洲语言提供大型词汇识别的途径：多语言建模和自我监督学习。我们收集了可用的开源数据并收集了15种语言的数据，并使用这些技术训练了实验模型。我们的结果表明，汇总多语言端到端模型中可用的少量数据，并预先培训无监督的数据可以帮助提高许多非洲语言的语音识别质量。

translated by 谷歌翻译